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Abstract. 

Primordial non-Gaussianity is a potentially powerful discriminant of the 
physical mechanisms that generated the cosmological fluctuations observed today. 
Any detection of non-Gaussianity would have profound implications for our 
understanding of cosmic structure formation. In this paper, we review past and 
current efforts in the search for primordial non-Gaussianity in the large scale 
structure of the Universe. 
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1. Introduction 

In generic inflationary models based on the slow roll of a scalar field, primordial 
curvature perturbations are produced by the inflaton field as it slowly rolls down its 
potential [JJ [21 EH H] ■ Most of these scenarios predict a nearly scale- invariant spectrum 
of adiabatic curvature fluctuations, a relatively small amount of gravity waves and tiny 
deviations from Gaussianity in the primeval distribution of curvature perturbations 
[51 [HI [7] • While the latest measurements of the cosmic microwave background (CMB) 
anisotropics favor a slightly red power spectrum [5], no significant detection of a B- 
mode or of primordial non-Gaussianity (NG) has thus far been reported from CMB 
observations. 

While the presence of a £?-mode can only be tested with CMB measurements 
[SI [10] , primordial deviations from Gaussianity can leave a detectable signature in the 
distribution of CMB anisotropies and in the large scale structure (LSS) of the Universe. 
Until recently, it was widely accepted that measurement of the CMB furnish the best 
probe of primordial non-Gaussianity However, these conclusions did not take into 
account the scale-dependence of the galaxy power spectrum and bispectrum arising 
for primordial NG of the local /^l type [T2J [T5] . These theoretical results, together 
with rapid developments in observational techniques that will provide large amount of 
LSS data, will enable us to critically confront predictions of non-gaussian models. In 
particular, galaxy clustering should provide independent constraints on the magnitude 
of primordial non-Gaussianity as competitive as those from the CMB and in the long 
run may even give the best constraints. 

The purpose of this work is to provide an overview of the search for a primordial 
non-Gaussian signal in the large scale structure. We will begin by briefly summarizing 
how non-Gaussianity arises in inflationary models Next, we will discuss the 

impact of primordial non-Gaussianity on the mass distribution in the low redshift 
Universe (<|3|). The main body of this review is 21 where we describe in detail an 
number of methods exploiting the abundance and clustering properties of observed 
tracers of the LSS to constrain the amount of initial non-Gaussianity. We conclude 
with a discussion of present and forecasted constraints achievable with LSS surveys 
(©• 

2. Models and observables 

Single-field slow-roll models lead to a very small level of primordial non-Gaussianity 
[HI [HI [7] . This is because they assume i) a single dynamical field (the inflaton) ii) 
canonical kinetic energy terms (i.e. perturbations propagate at the speed of light) iii) 
slow roll (i.e. the timescale over which the inflaton field changes is much larger than the 
Hubble rate) iv) an initial adiabatic Bunch-Davis vacuum. The lowest order statistics 
sensitive to non-Gaussian features in the initial distributions of scalar perturbations 
$(x) (We shall adopt the standard CMB convention in which <£>(x) is the Bardeen's 
curvature perturbation in the matter era) is the 3-point function or bispectrum 
B$(ki, k2, ka), which is a function of any triangle ki + k2 + k3 — (as follows from 
statistical homogeneity which we assume throughout this paper). It has been shown 
that, in the squeezed limit <C fci ~ &2, the bispectrum of any single- field slow- 
roll inflationary model asymptotes to the local shape [HI [HI [17] . The corresponding 
nonlinear parameter predicted by these models is = (1 — tl s ) rj 0.017 where n s 
is the tilt or spectral index of the power spectrum P$(fc), which is accurately measured 
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to be n s ~ 0.960 ± 0.013 [5]. Therefore, any robust measurement of /Jj£ we U above 
this level would thus rule out single-field slow-roll inflation as defined above. 

2.1. The shape of the primordial bispectrum 

Large, potentially detectable amount of Gaussianity can be produced when at least 
one of the assumptions listed above is violated, i.e. by multiple scalar fields [T8[ [T9]. 
nonlinearities in the relation between the primordial scalar perturbations and the 
inflaton field [HI [7] , interactions of scalar fields [20] , a modified dispersion relation or 
a departure from the adiabatic Bunch-Davies ground state [2T] . Generation of a large 
non-Gaussian signal is also expected at reheating [22] and in the ekpyrotic scenario 
[2"5] . Each of these physical mechanisms leaves a distinct signature in the primordial 
3-point function B$(k l7 k 2 , k 3 ), a measurement of which would thus provide a wealth 
of information about the physics generating the primordial fluctuations. Although the 
configuration shape of the primordial bispectrum can be extremely complex in some 
models, there are broadly three classes of shape characterizing the local, equilateral 
and folded type of primordial non-Gaussianity [24] [25]. The magnitude of each 
template "X" is controlled by a dimensionless nonlinear parameter which we 
seek to constrain using CMB or LSS observations. 

Any non-Gaussianity generated outside the horizon induces a three-point function 
that is peaked on squeezed or collapsed triangles for realistic values of the scalar 
spectral index. The resulting non-Gaussianity depends only on the local value of the 
Bardeen's curvature potential and can thus be conveniently parameterized up to third 
order by [B HQ ^ 

$(x) = 0(x) + /> f> 2 (x) + .gj^ 3 (x) , (1) 

where 0(x) is an isotropic Gaussian random field and /^Li 9nl are dimensionless, 
phenomenological parameters. Since curvature perturbations are of magnitude 
O(10 -5 ), the cubic order correction should always be negligibly small compared to 
the quadratic one when O(f^) ~ 0(g^f j ). However, this condition is not satisfied by 
some multifield inflationary models such as the curvaton scenario, in which a large 
and a small f^ can be simultaneously produced [19]. The quadratic term generates 
the 3-point function at leading order, 

i?i° c (k 1; k 2 ,k 3 ) = 2/& [P*(*i)P*(fca) + (eye.)] , (2) 

where (eye.) denotes all cyclic permutations of the indices and P^(fc) is the power 
spectrum of the Gaussian part </>(x) of the Bardeen potential. The cubic-order terms 
generates a trispectrum T$(ki, k 2 , k 3 , k4) at leading order. 

Equilateral type of non-Gaussianity, which arises in inflationary models with 
higher-derivative operators such as the DBI model, is well describe by the factorizable 
form [37] 

Bj q (kx,k 2 ,k 3 ) = 6/° q L [-(P (fc!)P4fc 2 ) + (eye.)) - 2(P <t> (k 1 )P 4> (k 2 )P <t> (k 3 )) 2/3 (3) 

+ (P 1/3 (fci)P 2/3 (fc2)P#3) + (perm.)) 

It can be easily checked that the signal is largest in the equilateral configurations 
k\ ss fc 2 ss fc 3 , and suppressed in the squeezed limit fc 3 <C fci ~ k%. Note that, in 
single-field slow-roll inflation, the 3-point function is a linear combination of the local 
and equilateral shape [15] . 
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As a third template, we consider the folded or flattened shape which is maximized 
for k 2 ~ k 3 « fci/2 [2B] 



(^(fci)F (fc 2 ) + (eye.)) + 3(F (fc 1 )F (fc 2 )P (fc 3 ))^ 
^ 1/3 (fci)^ 2/3 (fc2)^(fc3) + (perm.))" . 



(4) 



and approximate the non-Gaussianity due to modification of the initial Bunch-Davies 
vacuum in canonical single field action (although the latter peaks on squashed or 
collinear triangles). As in the previous example, B { £ 1 is suppressed in the squeezed 
configurations. Unlike however, the folded shape induces a scale-dependent bias 
at large scales (see 



2.2. Statistics of the linear mass density field 

The Bardeen's curvature potential $(x) is related to the linear density perturbation 
#o(k, z) at redshift z through the relation 

5 (k,z) = M(k,z)$(k) , (5) 

where 

Here, T(k) is the matter transfer function normalized to unity as k — > 0, f2 m is the 
present-day matter density, D(z) is the linear growth rate normalized to 1+z. Eq.© is 
important as it provides the connection between the primeval curvature perturbations 
and the low redshift mass density field, n-point correlator of the linear matter density 
field can thus be related to those of $(x), 

(Jo(k)---£ (k„))= ^nX(fc 4 )^( < f(ki)---$(k„)) ■ (7) 

Smoothing unavoidably arises when comparing observations of the large scale structure 
with theoretical predictions. Perturbation theory (PT), which is valid only in the 
weakly nonlinear regime |29j . or the spherical collapse model, which ignores the 
strongly nonlinear internal dynamics of the collapsing regions [301 13 1 j , require that 
the small-scale nonlinear fluctuations be smoothed out. For this reason, we introduce 
the smoothed linear density field Sr, 

6 R (k, z) = M(k, z)W R (k)$(k) = M R (k, *)*(k) , (8) 

where W R (k) is a (spherically symmetric) window function of characteristic radius R 
or mass scale M. We will assume a top-hat filter in configuration space throughout. 
Furthermore, since M and R are equivalent variables, we shall indistinctly use the 
notation S R and 8m in what follows. 



2.3. Topological defects models 

In addition to inflationary scenarios, there is a whole class of models, known as 
topological defect models, in which cosmological fluctuations are sourced by an 
inhomogeneously distributed component which contributes a small fraction of the total 
energy momentum tensor [351 133] • The density field is obtained as the convolution of 
a discrete set of points with a specific density profile. Defects are deeply rooted in 
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particle physics as they are expected to form at a phase transition. Since the early 
Universe may have plausibly undergone several phase transitions, it is rather unlikely 
that no defects at all were formed. Furthermore, high redshift tracers of the LSS may 
be superior to CMB at finding non-Gaussianity sourced by topological defects [34 . 
However, CMB observations already provide stringent limits on the energy density 
of a defect component 8 a , so we shall only minimally discuss the imprint of these 
scenarios in the large scale structure. 

3. Evolution of the matter density field with primordial NG 

In this Section, we summarize a number of results relative to the effect of primordial 
NG on the mass density field. These will be useful to understand the complexification 
that arises when considering biased tracers of the density field (see Q • 

3.1. Setting up non- Gaussian initial conditions 

Investigating the impact of non-Gaussian initial conditions (ICs) on the large scale 
structure traced by galaxies etc. requires simulations large enough so that many long 
wavelength modes are sampled. At the same time, the simulations should resolve the 
dark matter halos hosting the observed galaxies or quasars (QSOs), so that one can 
construct halo samples whose statistical properties mimic as closely as possible those 
of the real data. This favors the utilization of pure N-body simulations, for which a 
larger dynamical range can be achieved, rather than computationally more expensive 
hydrodynamical simulations. 

The evolution of the matter density field with primordial non-Gaussianity has 
been studied in series of large cosmological N-body simulations seeded with Gaussian 
and non-Gaussian initial conditions, see e.g. (33 ESI EB HO EH \M HQ [13j H2 HH g4] . 
For generic non-Gaussian (scalar) random fields, we face the problem of setting up 
numerical simulations with a prescribed correlation structure [45] . For the equilateral 
and folded type of non-Gaussianity, this task is not easily accomplished (because 
it requires the calculation of a number of convolutions which are computationally 
demanding). However, for primordial NG described by a local mapping such as the 
/jvjL model, this is a rather straightforward operation. This is the reason why most 
numerical studies of structure formation with inflationary NG have so far implemented 
the local shape solely. 

3.2. Mass density probability distribution 

In the absence of primordial NG, the probability distribution function (PDF) of the 
initial smoothed density field, i.e. the probability that a randomly placed cell of 
volume V has some specific density, is Gaussian. Namely, all normalized or reduced 
smoothed cumulants Sj of order J > 3 are zero. An obvious signature of primordial 
NG would thus be an initially non-vanishing skewness S3 — (#Jj)c/(#ij) 2 or kurtosis 
S 4 = (Sfyc/iSx) 3 - 3/(6%) [311 ESI S3]- Here, the subscript c denotes the connected 
piece of the n-point moment that cannot be simplified into a sum over products of 
lower order moments. At third order for instance, the cumulant of the smoothed 
density field is an integral of the 3-point function, 

fd 3 k 1 f d 3 k 2 fd 3 k 3 

(4)= y(w/(wy(^ Bii(ki ' k2 ' k3 ' z) ' (9) 
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where 

B H (ki,k 2 ,k3, z) = M R (k 1 ,z)M R {k2,z)M R (k 3 , z)B 9 (k 1 ,k. 2 ,k 3 ) (10) 

is the bispectrum of the smoothed linear density fluctuations at redshift z. Note 
that, while S^i?, z) oc D(z)~ 1 , the product erS^i?) does not depend on redshift. 
Over the range of scale 0.1 < R < 100 /i~ 1 Mpc accessible to LSS observations, 
aSjj(R) = aSs(R, = 1) is a weakly monotonically decreasing function of i? that 
is of magnitude ~ 10 -4 for the local, equilateral and folded templates discussed above. 
Strictly speaking, all reduced moments should be specified to fully characterise the 
density PDF, but a reasonable description of the density distribution can be achieved 
with moments up to the fourth order. 

Numerical and analytic studies generally find that a density PDF initially skewed 
towards positive values produces more overdense regions, whereas a negatively skewed 
distribution produces larger voids. Gravitational instabilities also generate a positive 
skewness in the density field, reflecting the fact that the evolved density distribution 
exhibits an extended tail towards large overdensities [551 HS1 EH 153 [521 G>3]. This 
gravitationally-induced signal eventually dominates the primordial contribution such 
that, at fixed normalization amplitude, non-Gaussian scenarios deviate more strongly 
from the fiducial Gaussian model at high redshift. More precisely, the time evolution 
of the normalized cumulants Sj can be worked out for generic Gaussian and non- 
Gaussian ICs using PT, or the simpler spherical collapse approximation. For Gaussian 
ICs, PT predicts that the normalized cumulants be time-independent to lowest non- 
vanishing order, with a skewness S3 rs 34/7, whereas for non-Gaussian ICs, the linear 
contribution to the cumulants decays as Sj(R, z) = Sj(R, oo)/D J ~ 2 (z) [54, 551 1561 157] , 

The persistence of the primordial hierarchical amplitude Sj(R, 00) sensitively 
depends upon the magnitude of Sn, N > J, relative to unity. For example, an 
initially large non-vanishing kurtosis could source skewness with a time-dependence 
and amplitude similar to that induced by nonlinear gravitational evolution [54J. 
Although there is an infinite class of non-Gaussian models, we can broadly divide 
them into weakly and strongly non-Gaussian. In weak NG models, the primeval 
signal in the normalized cumulants is rapidly obliterated by gravity-induced non- 
Gaussianity. This is the case of hierarchical scaling models where n-point correlation 
functions satisfy £„ oc w ^h £2 1 at large scales. By contrast, strongly NG 

initial conditions dominate the evolution of the normalized cumulants. This occurs 
when the hierarchy of correlation functions obeys the dimensional scaling £„ oc '' , 
which arises in the particular case of \ 2 initial conditions [58] or in defect models 
such as texture 59, 38, 60 . These scaling laws have been successfully confronted with 
numerical investigations of the evolution of cumulants |38[ 139] . 

Although gravitational clustering tends to erase the memory of initial conditions, 
numerical simulations of non-Gaussian initial conditions show that the occurrence of 
highly underdense and overdense regions is very sensitive to the presence of primordial 
NG. In fact, the imprint of primordial NG is best preserved in the low density tail of the 
PDF P(p\R) of the evolved density field p smoothed at scale R [3TJ[5T]. A satisfactory 
description of this measurement can be obtained from an Edgeworth expansion of the 
initial mass density field. At high densities p 1, the non-Gaussian modification 
approximately scales as p 3 / 5 whereas, at low densities p ~ 0, the deviation is steeper 
and behaves as p 6//5 [62] . 
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3.3. Power spectrum and bispectrum 



Primordial non-Gaussianity also imprints a signature in Fourier space statistics of the 
matter density field as positive values of /nl tend to increase the small scale matter 
power spectrum Ps(k) [TSJ (4TJ [63] and the large scale matter bispectrum -Ba(ki, k 2 , k 3 ) 
[121164]. 

In the weakly nonlinear regime where 1-loop PT applies, the Fourier mode of the 
density field for growing-mode initial conditions reads [H2 [55] 

d 3 q 1 d 3 q 2 



S(k,z) = S a (k,z) 



■<5 D (k-qi - q 2 )F 2 (qi,q2)<5o(qi,z)<5o(q2,20 . (11) 



(2tt) 3 (2tt) 3 

The kernel F 2 (ki, k 2 ) = 5/7 + /i(fci/fc 2 + k 2 /ki)/2 + 2fi 2 /7, where fi is the cosine of the 
angle between ki and k 2 , describes the nonlinear 2nd order evolution of the density 
field. It is nearly independent of f2 m and J7a and vanishes in the (squeezed) limit 
ki = — k 2 . At 1-loop PT, the second term in the right-hand side of Eq. (fTTj) generates 
a correction to the mass power spectrum, 

F 5 NG (fc, z) = P (k, z) + [P (22) (fc, z) + P {13) (k, z)] + A/f G (fc, z) . (12) 

Here, Pa(k) is the linear matter power spectrum at redshift z, -P( 22 ) and -P(i 3 ) are the 
standard one- loop contributions in the case of Gaussian ICs [65 , 66 , and 

d 3 q 



(k,z) = 2 



-F 2 (q, k - q)B (-k, q, k - q, z) 



(13) 



is the correction due to primordial NG [63]. This last term scales as D 3 (z) such that 
the effect of non-Gaussianity is largest at low redshift. Most importantly, F 2 (ki,k 2 ) 
vanishes in the limit ki = — k 2 as a consequence of the causality of gravitational 
instability. This strongly suppresses Eq. (|13[) at small wavenumbers, even in the local 
/jvjL model for which B (— k, q, k — q) is maximized in the squeezed limit |k| — > 0. For 
/nl ~ O(10 2 ), the magnitude of the correction is at a per cent level in the weakly 
nonlinear regime k < 0.1 /iMpc -1 , in good agreement with simulations [121 HU 157] . 
Extensions of the renormalization group description of dark matter clustering [68] to 
non-Gaussian initial density and velocity perturbations can improve the agreement up 
to wavenumbers k < 0.25 /iMpc -1 [691170] . 

To second order in PT, the matter bispectrum Bs(ki,k2,ks) is the sum of a 
primordial contribution and of two terms induced by gravitational instability |49[ 171] 
(Here and henceforth we omit the explicit z-dependence for brevity), 

B 5 (k!,k 2 ,k3) = B (ki,k 2 ,k 3 ) + [2F 2 (k 1 ,k 2 )P (fci)Po(fc 2 ) + (eye. 

d 3 q 



(2tt) 3 



F 2 (q, k 3 - q)T (q,k 3 - q,ki,k 2 ) + (eye.) 



(14) 



where To(ki, k 2 , k 3 , k4) is the primordial trispectrum of the density field. Note that 
a similar expression can be derived for the matter trispectrum, which turns out to be 
less sensitive to gravitationally induced nonlinearities [72] . The reduced bispectrum 
Q 3 is conveniently defined as 

^(ki.k^ka) 



<3 3 (ki,k 2 ,k 3 ) = j 



P S {k 1 )P s {k 2 ) + cyclic 



(15) 



For Gaussian initial conditions, Q 3 is independent of time and, at tree-level PT, is 
constant and equal to Q 3 (fc, k, k) = 4/7 for equilateral configurations [49]. For general 
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triangles moreover, it approximately retains this simple behavior, with a dependence 
on triangle shape through F 2 (k 1 ,k 2 ) [12]. The inclusion of 1-loop corrections greatly 
improves the agreement with the numerical data [73 . An important feature property 
of the matter bispectrum is the fact that the primordial part scales as Q3 oc 1/ Aia(k) 
for approximately equilateral triangles and under the assumption that is scale- 
independent [12]. This "anomalous" scaling considerably raises the ability of the 
matter bispectrum to constrain primordial NG of the local /^l type- Unfortunately, 
neither the matter bispectrum nor the power spectrum are directly observable with 
the large scale structure of the Universe. Temperature anisotropics in the redshiftcd 
21cm background from the pre-reionization epoch could in principle furnish a direct 
measurement of these quantities [741 1751 176) . but foreground contamination may 
severely hamper any detection. Weak lensing is another direct probe of the dark 
matter, although we can only observe it in projection along the line of sight |77| . 

As we will see shortly however, a similar scaling is also present in the power 
spectrum and bispectrum of observable tracers of the large scale structure such 
as galaxies. It is this unique signature that will make future all-sky LSS surveys 
competitive with forthcoming CMB experiments. 

4. LSS probe of primordial non-Gaussianity 

Discrete and continuous tracers of the large scale structure such as galaxies, the 
Lya forest, the 21cm hydrogen line etc., provide a distorted image of the matter 
density field. In CDM cosmologies, galaxies form inside overdense regions [75] and 
this introduces a bias between the matter and galaxy distributions [79) . As a result, 
distinct samples of galaxies trace the matter distribution differently, the most luminous 
galaxies preferentially residing in the most massive DM halos. This biasing effect, 
which concerns most tracers of the large scale structure, is still poorly understood. 
Models of galaxy clustering assume for instance that the galaxy biasing relation only 
depends on the local mass density, but the actual biasing could be more complex 
[501 151] . Because of biasing, tracers of the large scale structure will be affected by 
primordial non-Gaussianity in a different way than the mass density field. In this 
Section, we describe a number of methods exploiting the abundance and clustering 
properties of biased tracers to constrain the level of primordial NG. We focus on galaxy 
clustering as it provides the tightest limits on primordial NG (see jjSJ). 

Jf.,1. Halo finding algorithm 

Locating groups of bound particles, or DM halos, in simulations is central to the 
methods described below. In practice, one aims at extracting halo catalogs with 
statistical properties similar to those of observed galaxies or QSOs. This, however, 
proves to be quite difficult because the relation between observed galaxies and halos 
is somewhat uncertain. Furthermore, there is freedom at defining a halo mass. 

A important ingredient is the choice of the halo identification algorithm. There are 
two categories of halo finder: i) spherical overdensity (SO) finder [82] with overdensity 
threshold A v j r (z) ~ 200 and ii) Friends-of- Friends (FOF) finder with a linking length 
b ~ 0.15 — 0.2 83 . The mass of a SO halo is defined by the radius at which the inner 
overdensity exceeds A vil (z) times the background density p(z), whereas the mass of 
a FOF halo is the number of linked particles. Here, we will present results mostly 
for SO halos, as their mass estimate is more closely connected to the predictions of 
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the spherical collapse model, on which most of the analytic formulae presented in 
this Section are based. The question of how the spherical overdensity masses can 
be mapped onto friends-of-friends masses remains a matter of debate [84j . Clearly 
however, since the peak height depends on the halo mass M through the variance 
cr(M), any systematic difference will be reflected in the value of v associated to a 
specific halo sample. 

4-2. Abundances of voids and bound objects 

A departure from Gaussianity can significantly affect the abundance of highly biased 
tracers of the mass density field, as their frequency sensitively depends upon the tails 
of the initial density PDF 85, 86, 87 . The (extended) Press-Schechter approach has 
been extensively applied to ascertain the magnitude of this effect. Because it depends 
only on the skewness, it is weakly sensitive to the shape of the primordial bispectrum. 

4-2.1. Press-Schechter approach The Press-Schechter theory [88] and its extentions 
based on excursion sets [89l [90l E] predict that the number density n(M, z) of halos 
of mass M at redshift z is entirely specified by a multiplicity function f{v), 

P , x d In v , . 

where the peak height v{M, z) = S c (z) j a(M) is the typical amplitude of fluctuations 
that produce those halos. Here and henceforth, a(M) denotes the variance of the 
initial density field 5m smoothed on mass scale M oc fl 3 and linearly extrapolated 
to present epoch, whereas S c (z) ~ 1.68D(0) / D(z) is the critical linear overdensity for 
(spherical) collapse at redshift z. In the standard Press-Schechter approach, n(M, z) 
is related to the level excursion probability P(> S C ,M) that the linear density contrast 
of a region of mass M exceeds 5 c (z), 

/(*)-»&£-,/§».-*"• (17) 
where the last equality assumes Gaussian initial conditions. The factor of 2 is 
introduced to account for the contribution of low density regions embedded in 
overdensities at scale > M. In the extended Press-Schechter theory, 8m evolves with 
the mass scale M and f(v) is the probability that a trajectory is absorbed by the 
constant barrier 8 — S c (as is appropriate in the spherical collapse approximation) on 
mass scale M. In general, the exact form of f{v) depends on the barrier shape [9"2"] 
and the filter shape [93]. Note also that J dhiv f(y) — 1, which ensures that all the 
mass is contained in halos. 

Despite the fact that the Press-Schechter mass function overpredicts (underpre- 
dicts) the abundance of low (high) mass objects, it can be used to estimate the frac- 
tional deviation from Gaussianity. In this formalism, the non-Gaussian fractional 
correction to the multiplicity function is R(v, /^ L ) = f{v, f^ L )/f{^, 0) = (dP/dM)(> 
8 c ,M,f^ L )/(dP/dM)(> S C ,M, 0), which is readily computed once the non-Gaussian 
density PDF P(5m) is known. In the simple extensions proposed by [M] and |95| . 
P(8m) is expressed as the inverse transform of a cumulant generating function. In 
[9"5"] . the saddle-point technique is applied directly to P(Sm)- The resulting Edge- 
worth expansion is then used to obtain P(> S c , M). Neglecting cumulants beyond the 
skewness, one obtain 

Kv(-, /nl) * 1 + I *fh (- 3 - 3.) - (u I) (18) 
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after integration over regions above S c (z). In [94) . it is the level excursion 
probability P(> S c , M) that is calculated within the saddle-point approximation. This 
approximation asymptotes to the exact large mass tail, which exponentially deviates 
from the Gaussian tail. To enforce the normalization of the resulting mass function, 
one may define i/* = 5* /a with 5* = 5 c yl — Sz5 c /3, and use [94l [96] 



r2 n NG {M,z)dlnM 



v*f{v*) = M NGV _ ' ' - . (19) 

p d In v+ 



The fractional deviation from the Gaussian mass function then becomes 

uv dSs v± 



Rmvj 0> /nl) ~ ex P ( Y aS3 



6v+ d In v v 



(20) 



Both formulae have been shown to give reasonable agreement with numerical 
simulations of non-Gaussian cosmologies [97J [42] [98] (but note that 99, [13] have 
reached somewhat different conclusions). Expanding 6* — 5 C yl- S^S^/S at the first 
order shows that these two theoretical expectations differ in the coefficient of the vaS^ 
term. Therefore, it is also instructive to consider the approximation |100j 



r (vJnl) ~ ex P( ~Q aSi 



v v d{aS 3 ) 

2 ab3 ~ 6^1nV 



(21) 



which is designed to match better the Edgeworth expansion of [55] when the peak 
height is v ~ 1. Note that, when the primordial trispectrum is large (which, in the 
local model, would happen if ~ 10 6 ), terms involving the kurtosis should also be 
included (94] [95] I100| 1101] . In this case, it is also important to take into account a 
possible renormalization of the fluctuation amplitude, as — > as + 5as, to which the 
high mass tail of the mass function is exponentially sensitive jlOOj . Finally, note also 
that [13j [43] parametrize the fractional correction using N-body simulations. 



Figure [T] shows the effect of primordial NG of the local type on the halo mass 
function. The dotted-dashed curve represents the approximation Eq. (|2"Tj) . While 
the agreement is reasonable for the SO halos (top panel), the theory significantly 
overestimates the deviation measured in the FOF mass function with linking length 
b = 0.2 (middle panel). A similar effect is noted in [OS], who makes the replacement 
o~c 3c^/q with q 0.75 to fit their measurement of R{v, /nl) based on FOF 
halos. References |102l 1103] provide a physical motivation of this replacement by 
demonstrating that the diffusive nature of the collapse barrier introduces a similar 
factor. However, an overlooked but important fact is that the FOF and SO mass 
estimates systematically deviate from each other. In FigQ] in particular, the FOF 
mass is on average 20% larger than the SO mass. As shown in the bottom panel 
of Fig. [T] rescaling the FOF mass to account for this difference removes most of the 
discrepancy with the FOF data. This illustrates an important point: the impact of 
primordial NG on the statistics of DM halos is sensitive to systematics caused by 
the choice of the halo finder. As we will see below, this may also be true for the 
non-Gaussian halo bias. 

More sophisticated formulations based on extended Press-Schechter (EPS) theory 
and/or modifications of the collapse criterion look promising since they can reasonably 
reproduce both the Gaussian halo counts and the dependence on [102|, 11041 1105] . 
The probability of first upcrossing can, in principle, be derived for any non-Gaussian 
density field and any choice of smoothing filter [106] 1107] . For a general filter, the 
non-Markovian dynamics generates additional terms in the non-Gaussian correction 
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Figure 1. Fractional deviation from the Gaussian mass function as a function 
of the peak height v = S c /cr. Different symbols refer to different redshifts as 
indicated. The solid curve is the theoretical prediction Eq. (]_' I [ i at z = based 
on an Edgeworth expansion of the dark matter probability distribution function. 
In the top panel, halos are identified using a spherical overdensity (SO) finder 
with a redshift-dependent overdensity threshold A v i r (z). In the middle panel, a 
Fricnds-of-Fricnds (FOF) finding algorithm with linking length b = 0.2 is used. 
The bottom panel shows the effect of decreasing the FOF mass by 20% (see text). 
In all panels, error bars denote Poisson errors. For illustration, M = 10 15 Mq//i 
corresponds to v = 3.2, 5.2, 7.7 at redshift z = 0, 1 and 2, respectively. Similarly, 
M = 10 14 M Q /h and 10 13 M G /h correspond to V = 1.9, 3, 4.5 and 1.2, 1.9, 2.9 
respectively. 



to the mass function that arise from 3-point correlators of the smoothed density 5m 
at different mass scales [102] . However, large error bars still make difficult to test 
for the presence of such sub-leading terms. For generic moving barriers B{o~) such as 
those appearing in models of triaxial collapse [1081 1109] , the leading contribution to 
the non-Gaussian correction approximately is [104] 

R{v, /nl) ~ 1 + \ ^3 H 3 (^M) , (22) 
where H^{y) = and agrees well with the observed deviation [105] . 

4-2.2. Clusters abundance Rich clusters of galaxies trace the rare, high-density peaks 
in the initial conditions and thus offer the best probe of the high mass tail of the 
multiplicity function. To infer the cluster mass function, the X-ray and millimeter 
windows are better suited than the optical-wave range because selection effects can 
be understood better. 

Following early theoretical QTD1 US [TTT1 EH [112] and numerical [TT3l fll4l l36l ITT5] 
work on the effect of non-Gaussian initial conditions on the multiplicity function 
of cosmic structures, the abundance of clusters and X-ray counts in non-Gaussian 
cosmologies has received much attention in the literature. At fixed normalization 



Primordial non-Gaussianity from the large scale structure 



12 



of the observed abundance of local clusters, the proto-clusters associated with high 
rcdshift (2 < z < 4) Lycn emitters are much more likely to develop in strongly 
non-Gaussian models than in the Gaussian paradigm 11161 155] . Considering 
the redshift evolution of cluster abundances can thus break the degeneracy between 
the initial density PDF and the background cosmology. In this regards, simple 
extensions of the Press-Schechter formalism (similar to those considered above) have 
been shown to capture reasonably well the cluster mass function over a wide range 
of redshift for various non-Gaussian scenarios [117] . Scaling relations between the 
cluster mass, X-ray temperature and Compton y-parameter calibrated using theory, 
observations and detailed simulations of cluster formation [1181 1119] . can then be 
exploited to predict the observed distribution functions of X-ray and SZ signals and 
assess the capability of cluster surveys to test the nature of the initial conditions 
[I2DllI2Il[I22llI55l[mi[I25llI2BllI27llI55| . 

An important limitation of this method is that, for a realistic amount of primordial 
NG, the non-Gaussian signal imprinted in cluster abundances is small compared to 
the systematics plaguing current and upcoming surveys [129( 11301 1131] . Given the 
current uncertainties in the redshift evolution of clusters (one commonly assumes that 
clusters are observed at the epoch they collapse [130] ). the selection effects in the 
calibration of X-ray and SZ fluxes with halo mass, the freedom in the definition of 
the halo mass, the degeneracy with the normalization amplitude as (for positive /^lj 
the mass function is more enhanced at the high mass end, and this is similar to an 
increase in the amplitude of fluctuations cr 8 132 ) and the low number statistics, the 
prospects of using the cluster mass function only to place competitive limits on 
with the current data are small. A two-fold improvement in cluster mass calibration 
is required to provide constraints comparable to CMB measurements [131] . 



4-2.3. Voids abundance The frequency of cosmic voids, which is strongly sensitive 
to the low density tail of the initial mass PDF, offers another probe of non-Gaussian 
initial conditions [133] . The Press-Schechter formalism can also be applied to ascertain 
the magnitude of this effect. Voids are defined as regions of mass M whose density 
is less than some critical value 6 V < or, alternatively, as regions for which the 
three eigenvalues of the tidal tensor |134) lie below some critical value X v < 
[1331 1521 11351 1105] . An important aspect in the calculation of the mass function 
of voids is the over-counting of voids located inside collapsing regions. This voids-in- 
clouds problem, as identified by [136] ). can be solved within the excursion set theory 
by studying a two barriers problem: S c for halos and S v for voids. Including this 
effect reduces the frequency of the smallest voids [105] . Neglecting this complication 
notwithstanding, the differential number density of voids of radius R is [133( 1135] 



dn 9 pK \v v \ _ l/ 2^ 2 <iln| 



dR 2ir 2 V 2 R 4 dhxM 



l- jUs 3 # 3 (kl) 



(23) 



where v v — 6 v /(Jm- While a positive produces more massive halos, it generates 
fewer large voids 133, 105 . Hence, the effect is qualitatively different from a simple 
rescaling of the normalization amplitude a%. A joint analysis of both abundances of 
clusters and cosmic voids might thus provide interesting constraints on the shape of 
the primordial 3-point function. There are, however, several caveats to this method, 
including the fact that there is no unique way to define voids [1331 1137] . Clearly, voids 
identification algorithms will have to be tested on numerical simulations [138] before 
a robust method can be applied to real data. 
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4-3. Galaxy 2-point correlation 

In Gaussian cosmologies, correlations of galaxies and clusters can be amplified relative 
to the mass distribution |79) . Before this was realized, it was argued that primeval 
fluctuations need to be non-Gaussian [1391 1140] to explain the observed strong 
correlation of Abell clusters [141[ 1142] . Along these lines, |143| pointed out that 
primordial non-Gaussianity could significantly increase the amplitude of the two-point 
correlation of galaxies and clusters on large scales, However, except from 144 who 
showed that correlations of high density peaks in non-Gaussian models are significantly 
stronger than in the Gaussian model with identical mass power spectrum, subsequent 
work focused mostly on abundances f £)4.2p or higher order statistics such as the 
bispectrum ( 34.4[) . It is only recently that [T3] have demonstrated the strong scale- 
dependent bias arising in non-Gaussian models of the local type. 

4-3.1. The non-Gaussian bias In the original derivation of [13], the Laplacian is 
applied to the left and right hand side of $ = (f> + f^l^ 2 to show that, upon 
substitution of the Poisson equation, the overdensity in the neighborhood of density 
peaks is spatially modulated by a factor proportional to the local value of fa Taking 
into account the coherent motions induced by gravitational instabilities, the scale- 
dependent bias correction reads 

Ab K (k,f$£) = 3ftW - 1]^) ^ , (24) 

where b\(M) is the linear, Gaussian halo bias. The original result of [13] missed out 
a multiplicative factor of T(k)~ 1 which was introduced subsequently by |145) upon 
a derivation of Eq. (pM)) in the limit of high density peaks. The peak-background 
split approach [1461 11471 l9"2"j promoted by |148] shows that the scale-dependent bias 
applies to any tracer of the matter density field whose (Gaussian) multiplicity function 
depends on the local mass density only. In this approach, the Gaussian piece of the 
potential is decomposed into short- and long- wavelength modes, <j> = fa + fa,. This 
provides an intuitive explanation of the effect in terms of a local rescaling of the small- 
scale amplitude of matter fluctuations by a factor 1 + 2f^£fa (see also [TBI 11491 Wf\ ). 
As emphasized in [13] . the scale-dependence arises from the fact that the non-Gaussian 
curvature perturbations "^(x) are related to density fluctuations through the Poisson 
equation ([5]). There is no such effect in the x 2 model [1501 ITT) nor in texture-seeded 
cosmologies [151] for instance. 

The derivation of [145] , based on the clustering of regions of the smoothed density 
field 5m above threshold S c (z), is formally valid for high density peaks only. However, 
it is general enough to apply to any shape of primordial bispectrum [152] . In the high 
threshold limit v ^> 1, the 2-point correlation function of the level excursion set can 
be expressed as |110j 

{oo n— 1 „ _„ / s ^ 

rdn Xl .'::' ,Xl ' , X2 'v;;. X2 ,* = oH,(25) 
2 1 3-( n ~ j)- V J t imes (n — j) times / J 

where r = Xi — x 2 . For most non-Gaussian models in which the primordial 3- 
point function is the dominant correction, this expansion can be truncated at the 
third order and Fourier transformed to yield the non-Gaussian correction AP>„(fc) 
to the power spectrum. Assuming a small level of primordial NG, we can also write 
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AP >v (k) w 2b^Ab K Pji(k), where b^ = &i(M) -1» v 2 /S c is the Lagrangian bias, and 
eventually obtain 

Ab B (fc,/*j = b^kmkjD = ( m^q) )^'^ • (26) 

The dependence on the shape of the 3-point function is encoded in the function 

m H5zi, 

^'/nl) = ^^J™dk 1 %M a (k 1) Q) J + \m r {^, 0) B * ( p^ k) , (27) 

where a 2 = k 2 + k\ + 2/ifcfci. Note that, for < 0, this fist order approximation 
always breaks down at sufficiently small k because AP >l/ {k) < 0. 

The scale-dependent bias induced by the equilateral and folded bispectrum shape 
is computed in |152j . To get insights into the large scale behavior of Ab K (k, /J L ), 
let us identify the dominant contribution to F(k : /^l) m ^ ne limit fc <C 1- Setting 
■M. r(*/cx, 0) ~ A^i?(fci,0) and expanding P^( v / a) at second order in the ratio k/k\, 
we arrive at 

m/NL C )«/NL C (28a) 
HkJZ) « JnI [aE^^^)*^^ + i (n s -4)S fl (-2)fc 2 ] a- 2 (286) 

- lf^ R (^)k 4 -^o- 2 , (28c) 

assuming no running scalar index, i.e. dn s /d\nk — 0. The auxiliary function HJji) 
is defined as 

S» = / dk fc( 2+ ") 7W fl (fc, 0) 2 P4fc) . (29) 
27r Jo 

Hence, we have S H (0) = a 2 . For a nearly scale- invariant spectrum n s w 1, we obtain 
T(k,f^l) cx fc and J-(k,f^) cx fc 2 , such that the non-Gaussian bias is A6 K oc fc -1 
and A6 re =const. for the folded and equilateral bispectrum, respectively. Therefore, 
at large scales, the scale-dependence of the non-Gaussian bias is much smaller for 
the folded template, and nearly absent for the equilateral shape. This make them 
much more difficult to detect with galaxy surveys |152j . However, the equilateral 
and folded non-Gaussian bias depend sensitively upon the mass scale M through 
the multiplicative factor er~ 2 . For example, choosing R = 5 /i _1 Mpc instead of 
R = 1 /i Mpc would increase the effect by a factor of ~ 3. In the high peak 
limit, er~ 2 m bt,/S c (z), which cancels out the dependence on redshift but enhances 
the sensitivity to the halo bias, i.e. Ab K cx By contrast, Ab K cx 6l for the local 
model. 

At this point, it is appropriate to mention a few caveats to these calculations. 
Firstly, Eq. (|26|) assumes that the tracers form after a spherical collapse, which may 
be a good approximation for the massive halos only. If one instead considers the 
ellipsoidal collapse dynamics, in which the evolution of a perturbation depends upon 
the three eigenvalues of the initial tidal shear, S c (0) should be replaced by its ellipsoidal 
counterparts <5 OC (0) which is always larger than the spherical value j!08j . In this model, 
the scale-dependent bias Ab K is thus enhanced by a factor £ ec (0)/£ c (0) |149j . Secondly, 
Eq. ([26]) assumes that the biasing of the surveyed objects is described by the peak 
height v only, or equivalently, the hosting halo mass M. However, this may not be true 
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for quasars whose activity may be triggered by merger of halos [1531 1154) . Reference 
[148] used the EPS formalism to estimate the bias correction A6 merger induced by 
recent mergers, 

A6 mcrgcr = S- 1 , (30) 

so the factor 6i(M) - 1 should be replaced by h(M) - 1 - S^ 1 « b x (M) - 1.6. The 
validity of this result should be evaluated with cosmological simulations of quasars 
formation. In this respect, semi-analytic models of galaxy formation suggest that 
merger-triggered objects such as quasars do not cluster much differently than other 
tracers of the same mass |155j . However, this does not mean that the same should 
hold for the non-gaussian scale dependent bias. Still, since the recent merger model is 
an extreme case it seems likely that the actual bias correction is < A& merge r < • 
Thirdly, the scale-dependent bias has been derived using the Newtonian approximation 
to the Poisson equation, so one may wonder whether general relativistic (GR) 
corrections to Ai^k) -1 suppress the effect on scales comparable to the Hubble radius. 
Reference |156] showed how large scale primordial NG induced by GR corrections 
propagates onto small scales once cosmological perturbations reenter the Hubble radius 
in the matter dominated era. This effect generates a scale-dependent bias comparable, 
albeit of opposite sign to that induced by local NG [152] . However, this issue deserves 
further clarification as |157j have recently argued that there are no GR corrections to 
the non-Gaussian bias and that the scaling Ab K oc k~ 2 applies down to the smallest 
wavenumbers. 

Finally, we can also ask ourselves whether higher-order terms in the series 
expansion (|25p furnish corrections to the non-Gaussian bias of magnitude comparable 
to Eq. (f24|) . In the /^l model, the power spectrum of biased tracers of the density 
field can also be obtained from a local Taylor series in the evolved (Eulerian) density 
contrast 6 and the Gaussian part <j> of the initial (Lagrangian) curvature perturbation 
|158[ 167] . Using this approach, it can be shown that the halo power spectrum arising 
from the first order terms of the local bias expansion can be cast into the form [158 

P h (k) = [h(M) + 2 P R (k) (31) 

Hence, we obtain a second order term proportional to {f)$i J ) 2 M-~fi' Pii{k) = 
(/nl) 2 ^V(^) which, however, contributes only at very small wavenumber k < 
0.001 h~ 1 Mpc. There is another second order correction to the halo power spectrum 
that reads [100] 

APh(fc) = j(fk°l) 2 [bi(M) - l\ 2 S 2 c (z)S { 3 1 \M)M R (k,0)P 4> (k) . (32) 

Its magnitude relative to the term linear in f^£, Eq. (f24"|) . is approximately 0.03 at 
redshift z — 1.8 and for a halo mass M — 10 13 M@/h. Although this contribution 
becomes increasingly important at higher redshift, it is fairly small for realistic values 
of /nl- All this suggests that Eq. ([24)) is the dominant contribution to the non- 
Gaussian bias in the wavenumber range 0.001 < k < 0.1 /iMpc -1 . 



4-3.2. Comparison with simulations In order to fully exploit the potential of 
forthcoming large scale surveys, a number of studies have tested the theoretical 
prediction Eq.([24|) against the outcome of large numerical simulations [13j [42J 143} 

EHIISIISZI- 

At the lowest order, there are two additional albeit relatively smaller corrections to 
the Gaussian bias which arise from the dependence of both the halo number density 
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n(M,z) and the matter power spectrum Pg{k,z) on primordial NG 42 . Firstly, 
assuming the peak-background split holds, the change in the mean number density of 
halos induces a scale-independent offset which we denote Abi(f^f). In terms of the 
non-Gaussian fractional correction R(v, /^l) to the mass function, this contribution 
is 



It is worth noticing that A6i(/^£) has a sign opposite to that of /^lj because the bias 
decreases when the mass function goes up. Secondly, we also need to account for the 
change in the matter power spectrum (see 331). As a result, non-Gaussianity of the 
/nl type adds a correction Ab(k, /^l) to the bias b(k) of dark matter halos that reads 



at first order in fl?£. The linear bias b\ needs to be measured accurately as it controls 
the strength of the scale-dependent bias correction Ab K . In this respect, the ratio 
Phs(k)/ P$(k), where Phs{k) is the halo-matter cross power spectrum, is a better proxy 
for the halo bias as it is less sensitive to shot-noise. 

Reference 42 find that the inclusion of these extra terms substantially improves 
the comparison between the theory and the simulations. Considering only the scale- 
dependent shift Ab K leads to an apparent suppression of the effect in simulations 
relative to the theory. Including the scale-independent offset A&i considerably 
improves the agreement at all scales. Finally, adding the scale-dependent term 
bi(M)(P mm (k, f^)/ P mm (k,0) — 1) further adjusts the match at small scale k > 
0.05 /iMpc^ 1 by making the non-Gaussian bias shift less negative. Taking into account 
second- and higher-order corrections could extend the validity of the theory up to scales 
fe~0.1-0.3 /iMpc" 1 [67]. 

Auto- and cross-power analyses may not agree with each other if the halos and 
dark matter do not trace each other on scale k < 0.01 /iMpc" 1 where the non-Gaussian 
bias is large, i.e. if there is stochasticity. Fig[2]shows Phs(k) and Ph(k) averaged over 8 
realisations of the models with — 0, ±100. The same Gaussian random seed field tj> 
was used in each set of runs so as to minimize the sampling variance. Measurements of 
the non-Gaussian bias correction obtained with the halo-halo or the halo-matter power 
spectrum are in a good agreement with each other, indicating that non-Gaussianity 
does not induce stochasticity and the predicted scaling Eq. (1231) applies equally well for 
the auto- and cross-power spectrum. However, while a number of numerical studies of 
the model have confirmed the scaling Ab K (k, fj$£) <x VWfl(fc) -1 and the redshift 
dependence oc D{z)~ l [131 EH SSI HE] , the exact amplitude of the non-Gaussian bias 
correction remains somewhat debatable. Reference [32], who use SO halos and include 
the scale- independent offset A&i, find satisfactory agreement with the theory. By 
contrast, [98, 33], who use FOF halos, find that Eq. (|2"3[) is a good fit to the simulations 
only upon replacing 5 C by q5 c with q ~ 0.75. Part of the discrepancy may be due to 
the fact that (98j [43] do not include A&i, which leads to an apparent suppression of 
the effect. Another possible source of discrepancy may be choice of the halo finder. In 
this regards, Fig. [3] shows the non-Gaussian bias correction obtained with FOF halos. 
The best-fit values of /^l are somewhat below the input values of ±100, in agreement 
with the findings of [SHI H3] • This indicates that the choice of halo finder also affects 
the magnitude of the non-Gaussian halo bias. Discrepancies have also been observed 
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Figure 2. Halo-halo and halo-matter power spectra Ph(k) and Phs(k) measured 
in simulations of the Gaussian model and of the local /Sp type with /iffj = ±100. 
Halos of mass M > 2 x 10 13 M //i were identified at rcdshift 2 = 2 with a 
SO finder. The linear Gaussian bias of this sample is b\(M) = 2.53. The 
error bars represent the scatter among 8 realizations. The solid and dashed 
curve show the theoretical Ph(k) and Phs(k) obtained wih the non-Gaussian bias 
correction Eq. 1341 . For /i?^ = —100, the cross-power spectrum is negative on 
scales k < 0.005 /iMpc -1 , in good agreement with the theoretical prediction. 



between the theoretical and measured non- Gaussian bias corrections in non- Gaussian 
models of the local cubic-order coupling <?n£0 3 [100] - Understanding these results will 
clearly require a better theoretical modeling of halo clustering. 

4-3.3. Redshift distortions Peculiar velocities generate systematic differences between 
the spatial distribution of data in real and redshift space. These redshift distortions 
must be properly taken into account in order to extract f^ L from redshift surveys. On 
the linear scales of interest, the redshift space power spectrum of biased tracers reads 



where P$g and Pg are the density-velocity and velocity divergence power spectra, [i 
is the cosine of the angle between the wavemode k and the line of sight and / is 
the logarithmic derivative of the growth factor. For Pg, the 1-loop correction due to 
primordial NG is identical to Eq. flTBl provided F 2 (k l7 k2) is replaced by the kernel 
G2(ki, k 2 ) = 3/7 + n{k\/k2 + kijk\)j2 + 4/i 2 /7 describing the 2nd order evolution of 
the velocity divergence [58]. For P$g, this correction is 




Again, causality implies that G2(ki,k2) vanishes in the limit ki = — k2. For 
unbiased tracers with b\ = 1, the linear Kaiser relation is thus recovered at large 
scales k < 0.01 /iMpc -1 (see also [61]). For biased tracers, we still expect the 
Kaiser formula to be valid, but the distortion parameter (3 should now be equal to 



as [1551 nm\ 



P s (k,[i) = b\P s {k) +2b 1 fn 2 P SS (k) + f^Pg(k) 



(35) 





(36) 
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Figure 3. Fractional correction to the Gaussian halo bias in the = ±100 and 
Gaussian models. In constrast to Fig. [2] halos were identified with a FOF finder 
of linking length b = 0.2. Only the wavemodes to the left of the vertical line were 
used to fit Ab K (k, /J^l)- The best-fit value of /J?£ and the corresponding lcr error 
is quoted for each model (Figure taken from }166j ). 

P — f/(bi + Ab K ), where Ab K (k, /* L ) is the scale-dependent bias induced by the 
primordial non-Gaussianity. 

4-3.4- Mitigating cosmic variance and shot-noise Because of the finite number of 
large scale wavemodes accessible to a survey, any large scale measurement of the 
power spectrum is limited by the cosmic (or sampling) variance caused by the random 
nature of the wavemodes. For discrete tracers such as galaxies, the shot noise is 
another source of error. For weak primordial NG, the relative error on the power 
spectrum P is <Jp/P ~ 1/VN(1 + a^/P), where N is the number of independent 
modes measured and cr^ is the shot- noise [161] , Under the standard assumption of 
Poisson sampling, er^ equals the inverse of the number density 1/n and causes a 
scale-independent enhancement of the power spectrum. The extent to which one can 
improve the observational limits on the nonlinear will strongly depend on our ability 
to minimize the impact of these two sources of errors. By comparing differently biased 
tracers of the same surveyed volume [1621 1163] and suitably weighting galaxies (by the 
mass of their host halo for instance) |164[ 1165] , it should be possible to circumvent 
these problems and considerably improve the detection level. 

Figure [3] illustrates how the impact of sampling variance on the measurement of 
/jjl can be mitigated. Namely, the data points show the result of taking the ratio 
Ph(k, fNL,)/Ps(k, /Jjl) f° r each set of runs with same Gaussian random seed field <j) 
before averaging over the realisations. This procedure is equivalent to the multi- 
tracers method advocated by [162] . Here, P$ can be thought as mimicking the power 
spectrum of a nearly unbiased tracer of the mass density field with high number 
density. Although, in practical applications, using the dark matter field works better 
|166j . in real data P$ should be replaced by a tracer of the same surveyed volume 
different than the one used to compute P n - Figure [3] also shows that, upon taking 
out most of the cosmic variance, there is some residual noise caused by the discrete 
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nature of the dark matter halos. As shown recently [165] however, weighting the 
halos according to their mass can dramatically reduce the shot noise relative to the 
Poisson expectation, at least when compared against the dark matter. Applying such 
a weighting may thus significantly improve the error on the nonlinear parameter f^£, 
but this should be explored in realistic simulations of galaxies, especially because 
the halo mass M may not be easily measurable from real data |166) . This approach 
undoubtedly deserves further attention as it has the potential to substantially improve 
the extraction of the primordial non-Gaussian signal from galaxy surveys. 

To conclude this Section, we note that, while the PDF of power values P(k) 
has little discriminatory power (for large surveyed volume, it converges towards 
the Rayleigh distribution as a consequence of the central limit theorem) 167 , the 
covariance of power spectrum measurements (which is sensitive to the selection 
function, but also to correlations among the phase of the Fourier modes) may provide 
quantitative limits on certain type of non-Gaussian models |1611 1168] , 

4- 4- Galaxy bispectrum and higher order statistics 

Higher statistics of biased tracers, such as the galaxy bispectrum, are of great interest 
as they are much more sensitive to the shape of the primordial 3-point function than 
the power spectrum Q21 11691 11701 IM1 |4"4"] . Therefore, they could break some of the 
degeneracies affecting the non-Gaussian halo bias (For example, the leading order 
scale-dependent correction to the Gaussian bias induced by the local quadratic and 
cubic coupling are fully degenerate |100j ). 

Normalized cumulants of the galaxy distribution The skewness of the galaxy 
count probability distribution function could provide constraints on the amount of 
non-Gaussianity in the initial conditions. As discussed in fJ3] however, it is difficult 
to disentangle the primordial and gravitational causes of skewness in low redshift 
data unless the initial density field is strongly non-Gaussian. The first analyzes of 
galaxy catalogs in terms of count-in-cells densities all reached the conclusion that the 
skewness (and higher-order moments) of the observed galaxy count PDF is consistent 
with the value induced by gravitational instabilities of initially Gaussian fluctuations 
[501 11711 11721 11731 1511 1174) . Back then however, most of the galaxy samples available 
were not large enough to accurately determine the cumulants Sj at large scales [175] . 
Despite the 2 orders of magnitude increase in surveyed volume, these measurements 
are still sensitive to cosmic variance, i.e. to the presence of massive super-clusters or 
large voids. Nevertheless, the best estimates of the first normalized cumulants Sj of 
the galaxy PDF strongly suggest that high order galaxy correlation functions follow 
the hierarchical scaling predicted by the gravitational clustering of Gaussian ICs |176) . 
There is no evidence for strong non-Gaussianity in the initial density field as might 
by seeded by cosmic strings or textures |177] , 

The genus statistics of constant density surfaces through the galaxy distribution 
measures the relative abundance of low and high density regions as a function of 
the smoothing scale R and, therefore, could also be used as a diagnostic tool for 
primordial non-Gaussianity. While for a Gaussian random field the genus curve (i.e. 
the genus number as a function of the density contrast) is symmetric about 5r = 
regardless the value of R, primordial NG and nonlinear gravitational evolution can 
disrupt this symmetry [178] . The effect of non-Gaussian ICs on the topology of the 
galaxy distribution has been explored in a number of papers [3H1 11791 11801 11811 1182) . 
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For large values of R and a realistic amount of primordial NG, the genus statistics 
can also be expanded in a series whose coefficients are the normalized cumulants 
Sj of the smoothed galaxy density field. Therefore, the genus statistics essentially 
provides another measurement of the (large scale) cumulants of the galaxy distribution 
[I831H84] . 

4-4-2. Galaxy bispectrum Most of the scale-dependence of the primordial n-point 
functions is integrated out in the normalized cumulants, which makes them weakly 
sensitive to primordial NG. However, while the effect of non-Gaussian initial 
conditions, galaxy bias, gravitational instabilities etc. are strongly degenerated in 
the Sj, they imprint distinct signatures in the galaxy bispectrum i?h(ki, k2, ka), an 
accurate measurement of which could thus constrain the shape of the primordial 3- 
point function. 

In the original derivation of |169j . the large scale (unfiltered) galaxy bispectrum 
in the model is given by 

Bhflcx.ka.ka) = &?B (ki,k 2 ,k 3 ) + b\b 2 \P (ki)P (k 2 ) + (eye. 



2b\ 



F 2 (k 1 ,k 2 )Po(k 1 )P Q (k 2 ) + (eye.) . (37) 



Here, b\ and b 2 are the first- and second-order bias parameters that describe galaxy 
biasing relation assumed local and deterministic [185] . The first term in the right- 
hand side is the primordial contribution which, for equilateral configurations and in 
the /jiff model, scales as Mn(k 7 z)" 1 like in the matter bispectrum, Eq . (fl4| . The 
two last terms are the contribution from nonlinear bias and the tree-level correction 
from gravitational instabilities, respectively. They have the smallest signal in squeezed 
configurations. 

As recognized by [1701 [64] . Eq. ([37|) misses an important term that may 
significantly enhance the sensitivity of the galaxy bispectrum to non-Gaussian initial 
conditions. This contribution is sourced by the trispectrum Tr(\s.i, k 2 , k3, kj) of the 
smoothed mass density field, 

1 f eft 

y^3T Ji (k 1 ,k 2 ,q,k 3 -q) + (2perms.), (38) 

and reduces at large scales to the sum of the linearly evolved primordial trispectrum 
To(ki, k 2 , k3, ki) and a coupling between the primordial bispectrum £?o(ki, k 2 , k 3 ) 
(linear in f^ L ) and the second order PT corrections (through the kernel F 2 (ki,k 2 )). 
In the case of local non-Gaussianity and for equilateral configurations, the first piece 
proportional to To scales as (/nl) 2 ^~ 4 times the Gaussian tree-level prediction, with 
the same redshift dependence. Hence, it is similar to the second order correction 
(/nl) 2 M-~r Pr{^) that appears in the halo power spectrum (see Eq |3Tj) . The second 
piece linear in generates a signal at large scales for essentially all triangle shapes 
in the local model as well as in the case of equilateral NG. This second contribution is 
maximized in the squeezed limit (where it is one order of magnitude larger than the 
result obtained by |169] ) which helps disentangling it from the Gaussian terms. Note 
that a strong dependence on triangle shape is also present in other NG scenarios such 
as the x 2 model [55] , 

This newly derived contributions are claimed to lead to more than one order of 
magnitude improvement in certain limits |170) . but it is not yet clear whether these 
gains can be realized in any realistic survey. To accurately predict the constraints that 
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Figure 4. Ratio between the Lyman-a 3D flux power spectrum extracted from 
simulations of Gaussian and non-Gaussian initial conditions at redshift 2 = 2. 
The mean transmission is set to F = 0.8 (Figure taken from 190 ). 



could be achieved with future measurements of the galaxy bispectrum, a comparison 
of these predictions with the halo bispectrum extracted from numerical simulations 
is highly desirable. To date, the only numerical study [22] has measured the halo 
bispectrum for some isosceles triangles (fei = k<z). While the shape dependence is in 
reasonable agreement with the theory, the observed fc-dependence appears to depart 
from the predicted scaling. 

4-5. Intergalactic medium and the Lya forest 

Primordial non-Gaussianity also affects the intergalactic medium (IGM) as a positive 
/jvjL enhances the formation of high-mass halos at early times and, therefore, accelerate 
reionization (1861 11871 [188 . At lower redshift, small box hydrodynamical simulations 
of the Lya forest indicate that non-Gaussian initial conditions could leave a detectable 
signature in the Lya flux PDF, power spectrum and bispectrum [189] . However, while 
differences appear quite pronounced in the high transmissivity tail of the flux PDF 
(i.e. in underdense regions), the Lya ID flux power spectrum seems little affected. 
Given the small box size of these hydrodynamical simulations, it is worth exploring 
the effect in large N-body cosmological simulations using a semi-analytic modeling 
of the Lya forest [190] . Figure |4] shows the imprint of local type NG on the Lya 
3D flux power spectrum (which is not affected by projection effects) extracted from 
a series of large simulations at z = 2. The Lya transmitted flux is calculated in the 
Gunn-Peterson approximation [191 . A clear signature similar to the non-Gaussian 
halo bias can be seen and, as expected, it is of opposite sign since the Lya forest is 
anti-biased relative to the mass density field (overdensities are mapped onto relatively 
low flux transmission). A detection of this effect, although challenging in particular 
because of continuum uncertainties, could be feasible with future data sets. The Lya 
could thus provide interesting information on the non- Gaussian signal over a range of 
scale and redshift not easily accessible to galaxy and CMB observations [1891 1190) . 
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5. Current limits and prospects 

As the importance of primordial non-Gaussianity relative to the non-Gaussianity 
induced by gravitational clustering and galaxy bias increases towards high redshift, 
the optimal strategy to constrain the nonlinear coupling parameter(s) with LSS is to 
use large scale, high-redshift observations [34] . 

5.1. Existing constraints on primordial NG 

The non-Gaussian halo bias presently is the only LSS method that provides a robust 
limit on the magnitude of a primordial 3-point function of the local shape. It is a 
broadband effect that can be easily measured with photometric redshifts. The authors 
of [148] have applied Eq. (|24|) to constrain the value of J^l using a compilation of large- 
scale clustering data. Their constraint arise mostly from the QSO sample at median 
redshift z = 1.8, which covers a large comoving volume and is highly biased, b\ = 2.7. 
They obtain 

- 29 < f^l < +69 (39) 

at 95% confidence level. These limits are competitive with those from CMB 
measurements, —10 < /Jj£ < +74 |192j . It is straightforward to translate this 2- 
cr limit into a constraint on the cubic order coupling <?Jj£ since the non- Gaussian 
scale-dependent bias Ab K (k, g l ^l) has the same functional form as Ab K (k, |100) . 
Assuming — 0, one obtains 

- 3.5 x 10 5 < g^l < +8.2 x 10 5 . (40) 

These limits are comparable with those inferred from an analysis of CMB data using 
n-point distribution functions, -5.6 x 10 5 < .g}j£ < 6.4 x 10 5 |193j . 

Measurements of the galaxy bispectrum in several redshift catalogs have shown 
evidence for a configuration shape dependence in agreement with that predicted from 
gravitational instability, ruling out \ 2 initial conditions at the 95% C.L. |1941 1195] , 
Recent analyses of the SDSS LRGs catalogue indicate that the shape dependence of 
the reduced 3-point correlation Q3 ~ £3/ (£2) 2 is also consistent with Gaussian ICs 
[196 , although a primordial (hierarchical) non-Gaussian contribution in the range 
Q3 ~ 0.5 — 3 cannot be ruled out |197j . Other LSS probes of primordial non- 
Gaussianity, such as the abundance of massive clusters, are still too affected by 
systematics to furnish tight constraints on the shape and magnitude of a primordial 
3-point function, although the observation of a handful of unexpectedly massive high- 
redshift clusters has been interpreted as evidence of a substantial degree of primordial 

ng pBings irann). 

5.2. Future prospects 

Improving the current limits will further constrain the physical mechanisms for the 
generation of cosmological perturbations. 

The non-Gaussian halo bias also leaves a signature in cross-correlation statistics 
of weak cosmic shear (galaxy-galaxy and galaxy-CMB) [201(1202] and in the integrated 
Sachs- Wolfe (ISW) effect |148l 1149] . Measurements of the lensing bispectrum could 
also constrain a number of non-Gaussian models |203j . However, galaxy clustering will 
undoubtedly offer the most promising LSS diagnostic of primordial non-Gaussianity. 
The detectability of a local primordial bispectrum has been assessed in a series of 
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papers. It is expected that future all-sky galaxy surveys will achieve constraints 
of the order of A/^l ~ 1 assuming all systematics are reasonably under control 
US! [Hi HH QUI 0311 US mi HOI] . Realistic models of cubic type non-Gaussianity 
|100j . modifications of the initial vacuum state or horizon-scale GR corrections [152 
should also be tested with future measurement of the galaxy power spectrum. 

Upcoming observations of high redshift clusters will provide increased leverage 
on measurement of primordial non-Gaussianity with abundances and possibly put 
limits on any nonlinear parameter at the level of a few tens [127] . Combining 
the information provided by the evolution of the mass function and power spectrum 
of galaxy clusters can yield constraints with a precision A/^l ~ 10 for a wide field 
survey covering half of the sky |200] . Alternatively, using the full covariance of cluster 
counts (which is sensitive to the non-Gaussian halo bias) can furnish constraints of 
A/jJl ^1 — 5 for a Dark Energy Survey- type experiment |207[ 1208] . 

As emphasized in 21 however, the exact magnitude of the non-Gaussian halo bias 
is still uncertain at the ~20% level, partly due to the freedom at the definition of 
the halo mass. Understanding this type of systematics will be crucial to set reliable 
constraints on a primordial non-Gaussian component. To fully exploit the potential of 
future galaxy surveys, it will also be essential to extend the theoretical and numerical 
analyses to other bispectrum shapes than the local template used so far. Ultimately, 
the gain that can be achieved will critically depend on our ability to minimize the 
impact of sampling variance and shot-noise. In this regards, multi-tracers methods 
combined with optimal weighting schemes should deserve further attention as they 
hold the promise to become the most accurate method to extract the primordial non- 
Gaussian signal from galaxy surveys [162, 164, 11631 1165] . 
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